Correlated Components Analysis - Extracting Reliable Dimensions in Multivariate Data
نویسندگان
چکیده
How does one find data dimensions that are reliably expressed across repetitions? For example, in neuroscience one may want to identify combinations of brain signals that are reliably activated across multiple trials or subjects. For a clinical assessment with multiple ratings, one may want to identify an aggregate score that is reliably reproduced across raters. The approach proposed here — “correlated components analysis” — is to identify components that maximally correlate between repetitions (e.g. trials, subjects, raters). This can be expressed as the maximization of the ratio of between-repetition to within-repetition covariance, resulting in a generalized eigenvalue problem. We show that covariances can be computed efficiently without explicitly considering all pairs of repetitions, that the result is equivalent to multi-class linear discriminant analysis for unbiased signals, and that the approach also maximize reliability, defined as the mean divided by the deviation across repetitions. We also extend the method to non-linear components using kernels, discuss regularization to improve numerical stability, present parametric and non-parametric tests to establish statistical significance, and provide code.
منابع مشابه
Principal Component Analysis, A Powerful Unsupervised Learning Technique
Data mining is a collection of analytical techniques to uncover new trends and patterns in massive databases. These data mining techniques stress visualization to thoroughly study the structure of data and to check the validity of the statistical model fit which leads to proactive decision making. Principal component analysis (PCA) is one of the unsupervised data mining tools used to reduce dim...
متن کاملRelationship between Yield and its Component in Soybean Genotypes (Glycine Max L.) using Multivariate Statistical Methods
18 soybean genotypes were examined to investigate the relationships between some principal attributions of morphology with seed yield per soybean, by Random Complete Block Design (RCBD) study. This study was also carried out three replicates to gain reliable results. The results of variance analysis indicated that, there were significance differences among all soybean genotypes. Moreover, the r...
متن کاملModelling of Correlated Ordinal Responses, by Using Multivariate Skew Probit with Different Types of Variance Covariance Structures
In this paper, a multivariate fundamental skew probit (MFSP) model is used to model correlated ordinal responses which are constructed from the multivariate fundamental skew normal (MFSN) distribution originate to the greater flexibility of MFSN. To achieve an appropriate VC structure for reaching reliable statistical inferences, many types of variance covariance (VC) structures are considered ...
متن کاملMultivariate Statistical Analysis Decision-making Hybrid Method for Road Traffic Safety Evaluation in Iran
Obviously, improving the road safety and the efficient allocation of limited resources to the provinces according to their ranking should be done. This paper presents a hybrid method of multivariate statistical analysis-decision making to evaluate Iran road traffic safety. In order to solve the problems of road traffic safety, a macroscopic evaluation and traffic safety level classification in ...
متن کاملAttachment styles and emotional intelligence components: the predictors of health dimensions
Health, as one of the most important sources of comfort in life, is the complete physical, mental and social well-being, while there are dynamic mutual relationships among the three components. This study was aimed to investigate the role of attachment styles and emotional intelligence components in the prediction of health dimensions. The statistical population was consisted 160 parents who pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1801.08881 شماره
صفحات -
تاریخ انتشار 2018